UNIT 04 - Data
EXPECTATIONS:

LEARNING GOALS -- At the conclusion of this unit, I will have the following "ESSENTIAL KNOWLEDGE":

A) Identify Challenges Associated With Processing Data:

    1. The ability to process data depends on the capabilities of the users and their tools.

    2. Data sets pose challenges regardless of size, such as:

      § the need to clean data
      § incomplete data
      § invalid data
      § the need to combine data sources

    3. Depending on how data were collected, they may not be uniform. For example, if users enter data into an open field, the way they choose to abbreviate, spell, or capitalize something may vary from user to user.

    4. Cleaning data is a process that makes the data uniform without changing their meaning (e.g., replacing all equivalent abbreviations, spellings, and capitalizations with the same word).

    5. Problems of bias are often created by the type or source of data being collected. Bias is not eliminated by simply collecting more data.

    6. The size of a data set affects the amount of information that can be extracted from it.

    7. Large data sets are difficult to process using a single computer and may require parallel systems.

    8. Scalability of systems is an important consideration when working with data sets, as the computational capacity of a system affects how data sets can be processed and stored.

B) Extract Information From Data Using a Program.

  1. Programs can be used to process data to acquire information.

  2. Tables, diagrams, text, and other visual tools can be used to communicate insight and knowledge gained from data.

  3. Search tools are useful for efficiently finding information.

  4. Data filtering systems are important tools for finding information and recognizing patterns in data.

  5. Programs such as spreadsheets help efficiently organize and find trends in information.

    Some processes that can be used to extract or modify information from data include the following:

    § transforming every element of a data set, such as doubling every element in a list, or adding a parent’s email to every student record

    § filtering a data set, such as keeping only the positive numbers from a list, or keeping only students who signed up for band from a record of all the students

    § combining or comparing data in some way, such as adding up a list of numbers, or finding the student who has the highest GPA

    § visualizing a data set through a chart, graph, or other visual representation

C) Explain how programs can be used to gain insight and knowledge from data.

1) Programs are used in an iterative and interactive way when processing information to allow users to gain insight and knowledge about data.

2) Programmers can use programs to filter and clean digital data, thereby gaining insight and knowledge.

3) Combining data sources, clustering data, and classifying data are parts of the process of using programs to gain insight and knowledge from data.

4) Insight and knowledge can be obtained from translating and transforming digitally represented information.

5) Patterns can emerge when data are transformed using programs.

TERMS & CONCEPTS:

Term Description
   

Little Bo Peep Has Lost Her Sheep

and Radar Cannot Find Them

They'll all (face to face)

Meet in parallel Space

Preceding Their Leaders Behind them